Identifying , Quantifying , Extracting and Enhancing Implicit Parallelism

نویسندگان

  • Mayank Agarwal
  • MAYANK AGARWAL
  • Pradeep Dubey
  • Josep Torrellas
چکیده

The shift of the microprocessor industry towards multicore architectures has placed a huge burden on the programmers by requiring explicit parallelization for performance. Implicit Parallelization is an alternative that could ease the burden on programmers by parallelizing applications “under the covers” while maintaining sequential semantics externally. This thesis develops a novel approach for thinking about parallelism, by casting the problem of parallelization in terms of instruction criticality. Using this approach, parallelism in a program region is readily identified when certain conditions about fetch-criticality are satisfied by the region. The thesis formalizes this approach by developing a criticality-driven model of task-based parallelization. The model can accurately predict the parallelism that would be exposed by potential task choices by capturing a wide set of sources of parallelism as well as costs to parallelization. The criticality-driven model enables the development of two key components for Implicit Parallelization: a task selection policy, and a bottleneck analysis tool. The task selection policy can partition a single-threaded program into tasks that will profitably execute concurrently on a multicore architecture in spite of the costs associated with enforcing data-dependences and with task-related actions. The bottleneck analysis tool gives feedback to the programmers about data-dependences that limit parallelism. In particular, there are several “accidental dependences” that can be easily removed with large improvements in parallelism. These tools combine into a systematic methodology for performance tuning in Implicit Parallelization. Finally, armed with the criticality-driven model, the thesis revisits several architectural design decisions, and finds several encouraging ways forward to increase the scope of Implicit Parallelization.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine-Independent Evaluation of Parallelizing Compilers

A method is presented for measuring the degree of success of a compiler at extracting implicit parallelism. The outcome of applying this method to evaluate a state-of-the-art par-allelizer, KAP/Concurrent, using the Perfect Benchmarks and a few linear-algebra routines indicates that there is much room for improvement in the current generation of parallelizing compilers.

متن کامل

Vidushi: Parallel Implementation of Alpha Miner Algorithm and Performance Analysis on CPU and GPU Architecture

Process Mining consists of extracting valuable information from event logs produced by Process Aware Information Systems (PAIS) which support business processes and generate event logs as a result of execution of the supported business processes. Alpha Miner is a popular algorithm in Process Mining which consists of discovering a process model from the event logs. Discovering process models fro...

متن کامل

Identifying a Company's Non-Customer Online Communities: a Proto-typology

This paper describes an exploratory, qualitative study of a process for extracting, identifying and exploiting an enterprise's implicit (less visible) web communities using link analysis. By identifying the implicit communities' relationships to a specific enterprise, the information can be organized into stakeholder groups and used for 'listening' to the external environments. A prototypology ...

متن کامل

Implicit Finite Element Applications: A Case for Matching the Number of Processors to the Dynamics of the Program Execution

Generally, parallel scienti c applications are executed on a xed number of processors determined to be optimal by an e ciency analysis of the application's computational kernel. It is well-known, however, that the degree of parallelism found in di erent parts of an application varies. In this paper, we present the results of an in-depth study quantifying the advantages of matching the number of...

متن کامل

Implicit Schemes and Parallel Computing in Unstructured Grid Cfd

The development of implicit schemes for obtaining steady state solutions to the Euler and Navier-Stokes equations on unstructured grids is outlined. Applications are presented that compare the convergence characteristics of various implicit methods. Next, the development of explicit and implicit schemes to compute unsteady flows on unstructured grids is discussed. Next, the issues involved in p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009